
EuropSlsches Patentamt 
European Patent Office 
Office europeen des brevets 




(u) Publication number : 0 459 793 A1 



EUROPEAN PATENT APPLICATION 



(g) Application number : 91304880.7 
(§> Date of filing : 3a 05.91 



(§> int Cl. 6 : G06K 19/08, G06K 1/12, 
B42D 15/00 



(So) Priority : 30.05.90 US 530753 

@ Date of publication of application : 
04.12.91 Bulletin 91/49 



© Designated Contracting States : 
DE FR GB 



@ Applicant : XEROX CORPORATION 
Xerox Square - 020 
Rochester New York 14644 (US) 



(7§) Inventor : Johnson, Walter A.L. 
1345 Blackfiold Drive 
Santa Clara CA. 94051 (US) 
Inventor : Faleta, Baldo A. 
P.O. Box 4564 
Stanford, CA 94305 (US) 
Inventor : Smith III, Eroi Z. 
924 Los Robles 
Palo AJto, CA 94306 (US) 

@ Representative : WeatheraJd, Keith Baynes et 

Rank Xerox Patent Department Albion House. 
55 New Oxford Street 
London WC1A 1BS (GB) 



@ Document processing. 



) A form (10) including user-modifiable fields (18) and an encoded description (26) of the location, size, 
type, etc. of the fields allows direct programming of a form interpreter. Other information, including the 
processing of the form, encoded data, etc., may be Included in the encoded information. A system for 
creating forms carrying an encoded description of selected attributes of the fields includes means for 
selecting or creating fields and locating the fields on a form whle generating, substantially simul- 
taneously, the encoded description of the selected attributes. A form composer then allows merging of 
the form and its encoded description for printing or electronic transmission. A system for reading such 
forms includes a scanner, decoding device, and processor. By reading such forms, data may be entered 
into or recalled from a data-processing system, or a form interpreter may be programmed locally or 
remotely, for subsequent handling of forms. 1 
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This present invention relates generally to docu- 
ment processing, and more specifically to documents 
having printed thereon indica of the position, size, 
type of fields which may contain data to be extracted 
from the document, and systems for producing and s 
utilizing same. 

Machine-readable forms have been in common 
use some time. Such forms provide a mechanism for 
enabling actions to be taken based on marks on a 
paper without requiring human intervention such as 10 
reading or interpreting the forms. The marks on such 
forms are extracted under the control of a device com- 
monly referred to as a form interpreter. The forms are 
"read," most often optically by a scanner of the like, 
and the forms interpreter then locates and 15 
charaterizes the marks on the forms, and may output 
control signals as a function of the presence, location, 
nature, etc, of the marks to peripheral devices. 

A "form* of the type discussed above is defined 
for the purposes of the present invention as either a 20 
tangible printed document or a data structure repr- 
senting such a document The form may contain reg- 
ions of arbitrary text arbitrary graphics, and fields. 
"Fields," as used herein, is taken to mean regions of 
the form, either physical regions of the printed docu- 25 
ment or structured regions of the data structure rep- 
resenting the printed document which are to be 
modified by a user. As used herein, a "user" may be 
either human or machine. Further, as used herein, 
"modify* shall be taken to mean-enter, add, delete, 30 
change, alter, connect disconnect highlight fill-in, 
erase, str&e-out when referring to a field. Examples 
of fields include "chek box" fields (also called "bub- 
bles"), alpha or alpha-numeric fields, image fields, 
etc. A form will often also "include a reference point 35 
indication from which the location of the fields may be 
measured. 

Information earned by forms can conveniently be 
divided into three categories: data, machine instruc- 
tions, and other information. As used above, data is ao 
taken to mean information carried by the form to be 
read or extracted from the form for processing. Exam- 
ples of data include blank or fBled-in bubbles on a 
standardized examination answer form, payee fields 
on checks which are parsed for processing, etc. 45 
Machine instruction as used above refers to infor- 
mation carried by a form which is interpreted by the 
forms interpreter and which causes acton either by 
the forms interpreter or by a remote device. Examples 
of machine instructions include information located on 50 
a form which, when read, cause data to be copied to 
or form memory locations of a computer, cause a 
mathematical or logical procedure to be applied to 
particular data, etc. Other information, as used above, 
refers generally to information ignored by the form $5 
interpreter, such as the arbitrary text and graphics 
mentioned above, prompts or instructions on the form 
to aid the user in filling in fields, information for the 



user's interest ornamental treatment etc. The first two 
categories of information, data and machine instruc- 
tions, are of interest herein. 

Forms carrying data are the most common type of 
form, and examples may be readiy found. For 
example, US-A-4,634.148 teaches a form which is a 
draft check carrying data in the form of payee, 
amount and maker. The fields carrying the data are 
located and the data is extracted for processing 
according to a preprogrammed scheme. Forms carry- 
ing instructions interpreted and used by machines are 
also known, for example from US-A-4,757,348, which 
discloses an electronic reprographics/printing system 
which uses printed control forms, called separators, to 
segregate groups of documents from one another and 
to input control or programming instructions for pro- 
cessing the documents associated with each control 
form. In fact forms carrying data as well as machine 
instructions are known, for example as taught by US- 
A-4,494,862 which describes a system wherein a 
form is given a bar code which, when interpreted by 
the forms interpreter section of the system, causes 
the system to read and print only those rows on the 
form marked with a special pen (see, e.g., col. 8, lines 
32 et seq.) 

Another reference of interest is US-A-4,728,984 
which relates to a system including an electronic 
printer for recording digital data on plain paper, 
together with the use of an input scanner for scanning 
digital data that have been recording on such a record 
medium to upload data into an appropriate device 
such as a computer or the printer itself. The appli- 
cations of the system of this reference, however, are 
limited to decoding secret documents and inputting 
program information into a computer. 

Forms of the data-carrying type may in feet carry 
several different types of user-applied data. For 
example, the above mentioned bubbles on a standar- 
dized examination answer form, and the payee fields 
on checks which are parsed for processing, represent 
two different types of data. In general, the data types 
are: digitally-coded data, for example the fBled-in or 
not fffl ed-in state of a bubble; data for character rec- 
ognition, such as bar codes, alpha-numerical data for 
optical character recognition (OCR) and the like; and 
data for image-wise handling, such as the payee field 
mentioned above, graphics and the like. 

It is important for a practical form-using system to 
be able to distinguish between the various data types. 
One method for doing so is disclosed in US-A- 
4,588,21 1 which discloses a ma chine-read able docu- 
ment having fields identified by a coating of 
fluorescent ink. Data are written into the fields by the 
user on top of the fluorescent ink such that when the 
fields are illuminated by a proper light source, the writ- 
ten data will be black in sharp contrast to the fields. 
The fields include a binary coding which is applied by 
selectively blanking out regions of the fluorescent ink 
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at the border of the fields, or regions of fluorescent Ink 
remote from but logically associated with the field, for 
example as shown in Fig. 5, and discussed at col. 7, 
line 14, through col. 8, line 6. The system dteting^ 
uishes between the various data types by using the 
coding to cause different fields to be copied to diffe- 
rent locations for printing. 

Although machine processing of forms results in 
high speed and accuracy of processing, the systems 
disclosed in the previously mentioned references 
have several important limitations. These limitations 
have, inter alia, forced the use of machine-read forms 
to be practical only when large numbers of identical 
forms are used, limited the organization and aesthe- 
tics of forms, and complicated the form creation pro- 
cess. 

First, the form interpreter must be preprogram- 
med with a description of the form in order to locate 
the form fields. In most cases, a description of the 
physical location of the fields or parts of the fields rela- 
tive to a reference point must be preprogrammed. 
This preprogramming requires substantial time, effort 
and training, and most often is performed by an 
operator different from the person making up the form 
itself. Genericaly, there is a presently unf flled need in 
the art for a form, electronic or paper, which may be 
interpreted by a general form parser that has no pre- 
vious knowledge of the form. 

A second limitation is that any encoded instruc- 
tions relative to specific data must either be physically 
part of the data field or otherwise physically or logi- 
cally associated with the data field, ft is desirable for 
form organization and aesthetics to be able to locate 
instructions (as well as other relevant information 
about form content and structure) at any arbitrary 
position a form designer chooses. 

A third limitation is that a form designer has had 
to create fields separately then add supplemental 
information, such as coded field type, if coded infor- 
mation about the fields is to be carried by the form. It 
is desirable to allow simultaneous creation of a form 
and creation of a coded description of the form. That 
is, a form creation system is needed that allows a form 
designer to create a form such that the system keeps 
track of the position, type, etc. of fields of the form, and 
automatically includes the coded description in the 
form's data structure and/or automatically prints the 
coded information on a printed copy of the form 
together with the alpha-numerics, graphics and other 
elements of the form. 

A fourth limitation is that present form interpreters 
require manual preprogramming prior to form inter- 
pretation. However, a form interpreter system is des- 
ired which works with coded forms to read from the 
form instructions on extraction and handling of the 
data and other information the form carries. 

Related to this limitation is another limitation - it 
has heretofore not been possible to program a form 



interpreter remotely- That is, at present, to program a 
form interpreter, the programmer must have direct 
physical access to the workstation, personal com- 
puter, or the like f which controls the interpreter. A 
5 method and apparatus is desired for programming the 
interpreter remotely, say via a paper form, transmitted 
to the interpreter by facsimie, via communication 
directly from the work station or personal computer 
the form resides on, etc. 
10 Yet another limitation is that the above described 

art is not capable of designating a field as more than 
one data type. That is, it has heretofore not been 
possible to designate the contents of a field for a var- 
iety of d ifferent processing. The ability to so designate 
15 the contents of a field for multiple processing avoids 
the need for rescanning, saving both processing time 
and computer memory space. 

The present invention encompasses a form, and 
systems for creating and interpreting such a form. The 
20 form itself, whether represented electronically or prin- 
ted in hard copy, carries an encoded description of 
itself. The form may include one or more of a variety 
of types of fields, as well as other non-field infor- 
mation. The form generation portion of the system 
25 automatically encodes information about the fields as 
the form is being created, and integrates that encoded 
information into the electronic and printed represen- 
tations of the form. The forms interpreter portion erf the 
system may then read the form's field description from 
30 the form itself and. based on this description, interpret 
the form. By locating encoded information about form 
fields directly on the form, the form interpreter may be 
quickly, simply and automatically programmed for 
that particular form. Thus, it becomes practical to use 
35 specialized forms. The programming of the interpreter 
by the encoded form information may be by scanner 
or, remotely, by facsimile. The system also is capable 
of allowing direct station to station communication of 
the forms via facsimile or network communication pro- 
40 tocds. Furthermore, the resulting forms, when in hard 
copy format, may be reproduced using standard office 
duplicating equipment 
The system includes: 

- A forms description language; 

45 - A computer-based editor for creating and print- 

ing forms; 

- Computer software to convert a facsimile input 
into an image file, and 

- Computer software to process the image file, 
so either from facsimile or from a scanner, so as to 

retrieve the form's description and interpret the 

form. 

The form may be structured in virtually any man- 
ner, and include arbitrary text, arbitrary graphics, and 
55 a variety of fields such as check boxes, alpha-nume- 
ric, and/or image fields among others. In addition, the 
form wOl include encoded information. In the hard 
copy representation of the form, this encoded infor- 
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mation may be printed on the face of the form. In the 
electronic representation of the form, the encoded 
information wBI be part of the data structure of the 
form. The encoded information w8l either be in a pre- 
defined location, so that the interpreter may be prog- s 
rammed to look for it in that location, or the form may 
be searched for its location. In either case, the form 
wffl also include a reference point from which the 
form's layout is calculated. 

The nature of the particular coding scheme is not 10 
important to the present invention, other than that 
whatever scheme is used, it must be able to encode 
the information compactly enough that room is not dis- 
placed which is needed for the fields themselves. 
Examples of acceptable coding schemes include bar- is 
codes, morphological glyphs, etc 

The system is composed of several components 
including a computer-based form design tool and 
interpreter. The form design tod allows the user to 
choose or create a variety of fields, such as check 20 
boxes, alpha-numeric or graphic fields, eta When the 
user creates a form, the design tool automatically 
creates a description of the form. When the user 
selects and locates a field on the form, the form design 
tool incorporates a description of the field, including 25 
the type or types of field and its location, into the form 
description. When the form is printed, an encoded 
description is automatically printed on the form. The 
user may supplement the type and location infor- 
mation with other information, such as an operation to 30 
be performed on the contents of a field, specific inter- 
relationships between contents of two or more fields, 
other action to be taken, etc. This supplemental infor- 
mation also becomes part of the form description 
which is encoded by the design tool. 35 

The interpreter wil typically be embedded in 
software. It wil accept information from a scanner and 
be capable of culling from the scanner information the 
encoded information representing the form descrip- 
tion. The interpreter will interpret the encoded infor- 40 
mation and perform either preprogrammed operations 
on the information located in specified fields or of a 
specified data type, or perform operations on that 
information from instructions encoded with the form 
description itself. 45 

Distinguishing between the various data types 
and machine instructions by indications carried by the 
form itself are of particular interest herein. A form vir- 
tuaHy complete unto itself (Le.. carrying data and 
instructions relating to use data) conserves valuable so 
memory space on computers used in a form-handling 
system, reduces programming of the form interpreter, 
allows one form to be read on a variety of different 
machines or by a variety of different form-handling 
systems, etc. Thus, the present invention provides a 55 
complete and convenient form processing system, 
overcoming the limitations of the prior art. 

The present invention win now be described by 



way of example with reference to the accompanying 
drawings, in which: 

Fig. 1 shows one embodiment of a printed form 

according to the present invention; 

Fig. 2 shows one embodiment of a data structure 

representing a form according to the present 

invention; 

Fig. 3 is a flow chart of one embodiment of a sys- 
tem for processing a form carrying a coded des- 
cription of the location, type, etc., of 
user-modifiable fields; 

Fig. 4 shows one embodiment of a system for 
generation of a form carrying a coded description 
of the location, type, etc. v of user-mod rftable fields 
according to the present invention, and 
Fig. 5 is a flow chart representation of a process 
for creating a form carrying a coded description of 
the location, type, etc., of user-modifiable fields. 
Turning now to Fig. 1, a blank form 10 according 
to one embodiment of the present invention, is shown. 
The format of the Blustrative form is a printed paper 
document, the paper itself being a common example 
of generic "carrier means," to which there may be 
imparted marking or "indicia" of various types. Form 
1 0 is of the type to which a user may impart markings, 
which markings may be read or sensed by a machine 
such as an optical scanner. The markings imparted by 
the user represent data, often of the type to be stored 
in a digital computer memory or the Ike. The markings 
imparted by the user need to be located at specific 
locations on the form so that the form interpreter may 
be instructed where to look for the data to be read. 
Thus, some aid needs to be provided to the user to 
facilitate proper location of the markings. To this end, 
form 10 wfll generally include printing prior to use, 
such as an outline of field regions to be filled in, bor- 
ders of image fields, etc. To further assist a user in 
completing a form, printed instructions are often pre- 
printed on the form itself. 

The preprinting on form 10 may include arbitrary 
text 12, such as document or field titles, the above- 
mentioned instructions, and the lice, and arbitrary 
graphics 14 such as graphical symbols, the above- 
mentioned field outlines, etc. The length, size, posi- 
tion, content, and other details of the arbitrary text 12 
and arbitrary graphics 14 do not impact the nature of 
the present invention. In fact, a form interpreter utiliz- 
ing a form such as that described herein wifl ignore the 
arbitrary text 12 and arbitrary graphics 14 in favor of 
the contents of certain fields and encoded information 
regions. 

In order to facilitate locating regions of form 10 
marked for reading, Le., fields, form 10 includes a 
reference point 16 from which the layout of the 
remainder of the form is calculated. The form interpre- 
ter locates this point, and measures position of the 
contents of the fields to be read therefrom. A conve- 
nient location for reference point 16 is the upper left- 
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hand comer of the form. Thus, the location of any field 
may be described in terms of horizontal and vertical 
displacements from the reference point 

A form according to the present invention will 
include one or more arbitrarfly located field regions, 
which include one or more fields of the type described 
above.These fields may be "check boxes" (or "bub- 
bles") 18. single or multi digited numeric or alpha-nu- 
meric fields, such as the multi digited numeric field 20, 
and/or the multi character alpha field 22. Image fields 
24, La., regions of text, graphics or other information 
scanned and saved as a single image, are yet another 
type of field which may be located on document 10. 
Arbitrary graphics 14 should be distinguished from the 
contents, if any. of image field 24. Whereas the con- 
tents of image field 24 will be maintained as data for 
processing, a form interpreter utflizmg a form such as 
that described herein wiJ ignore the arbitrary graphics 
14. much as it wffl ignore the arbitrary text 12. 

A region of encoded information 26 which repre- 
sents a structural description of form 10, as well as 
other selected information, will be located on the form 
itself. The encoded information contained in region 26 
includes, mtar alia, the complete description of the 
location of the fields on the form which enables arbit- 
rary placement of the fields on the form. In fact region 
26 may also be arbitrarily located. Specifically, region 
26 need not be physically or logically placed on form 
10 with reference to the fields. Rather, in one embo- 
diment, the form interpreter is instructed where on 
form 10 region 26 may be found. Alternatively, the 
form may be searched for region 26, based on data 
type, format, etc. Once located, the information con- 
tained in region 26 may be read by a scanner and 
decoded by appropriate decoding means to provide 
the position information needed to read and process 
the remainder of the form. It is important to note that 
by providing a complete description of the location of 
the fief d on the form itself, the fields may be arbitrary 
located on the form. That is, the fields may be located 
at any position on the form convenient or desired. 

The method of coding the information contained 
in region 26 may be by any convenient machine-read- 
able coding scheme. One example is the so-called 
well known "bar codes". 

The encoded information carried by region 26 
may include a description of any attribute of the form. 
However, at a minimum, the encoded information will 
include a description of the physical location of one or 
more fields on form 10, relative to reference point 16, 
and a description of the type of that one or more fields 
(i.e_, bubble, alpha-numeric, image, etc.) Examples of 
the further types of information which may be carried 
by region 26 are instructions to a processor for speci- 
fic processing of selected data, including data remote 
now form 10, dialing instructions to a facsimie 
machine acting as an interface between the document 
scanner and the form interpreter, network addresses 



for the routing of selected data, data which are to be 
processed, eta The point here is that by providing a 
region of encoded information which ties directly to 
user-modifiable fields, a form may be provided that is 
5 a direct path between user and form interpreter — no 
preprogramming of the form interpreter is required. 
Furthermore, programming of processing apparatus 
may also be accomplished by the encoded infor- 
mation, thus alleviating the need to preprogram that 
10 portion of a data-processing system as weB. 

The above form description has been given with 
respect to a single page form. It will be appreciated 
that the foregoing description applies equally to forms 
of more than one page. That is, each page of a mul- 
15 ti-page form should include a reference point such as 
point 16, at least one field, such as fields 18. 20, 22, 
or 24, and a region of encoded information. However, 
by proper coding of information In region 26, a form 
interpreter may be programmed to recognize pages of 
20 a form carrying less than all of the above. For 
example, a form interpreter may be programmed by 
reading the coded information of region 26 (by 
methods such as the one described further below) to 
skip one or more pages of a multi-page form which do 
25 not contain fields. Ucewise. the locations of fields on 
various pages could be programmed via a single-en- 
coded information region so that each page need not 
carry such a region. 

The description above of the form aspect of the 
30 present invention has been from the point of view of 
a paper document It will be appreciated, however, 
that the nature of the present invention lends itself 
equally well to an entirely electronic form, of the type 
that may be transferred from one portion of a data- 
35 processing system to another, or from one data- 
processing system to another. For example, an 
electronic form 30 may be a structure of digital data 
of the type shown In Fig. 2, including control infor- 
mation region 32, predefined data region 34. user- 
40 modifiable (field) data region 36 and form processing 
and description region 38. Again, the form w9l have 
associated with it one or more fields and encoded 
information allowing a direct path between user and 
form interpreter. Here, however, the encoded infor- 
ms matfon wil be an indication of the location of field data 
in the data structure (i.e., pointer location or the like). 

Another aspect of the present invention is the pro- 
cessing of data by way of a form of the type described 
above. By way of example, a system for the proces- 
50 sing of a form carrying a coded description of the loca- 
tion, type, etc, of user-modifiable fields is shown in 
Fig. 3. According to the embodiment of the invention 
illustrated, the system comprises a scanner 50 of the 
type capable of converting the appearance or image 
55 of a form, such as form 10 of Fig. 1, into an electronic 
representation such as a bitmap or the lice. The elec- 
tronic representation of the form is passed from scan- 
ner 50 to a recognition unit 52 which includes 



6 



9 



EP 0 459 793 A1 



10 



recognition software for converting the bitmap or simi- 
tar representation into elemental textual and graphical 
data blocks to the extent possible. For example, rec- 
ognition software generally can correlate printed 
typographic characters with their ASCII encodings 
with substantial success. AddittonaSy, the recognition 
software is sometimes capable of inferring some or all 
of the page layout features of the form from its bitmap 
representation, thereby allowing identification of par- 
ticular regions of information based on physical loca- 
tion, and capable of making probability-based 
determinations of character type orient of printed text, 
thereby allowing identification of particular regions of 
information based on character type. The aim at first 
pass through the form is to locate the regk>n(s) of 
encoded information and to decode the information to 
allow further processing of the form. This may be done 
in one of at least two different ways. First, the encoded 
information region will be located from the location of 
a reference point and preprogrammed information 
about the location of the encoded region relative to the 
reference point According to this method, recognition 
unit 52 will locate the reference point, 16 on form 10, 
Fig. I. Recognition unit 52 will then pass the reference 
point location information to preprocessing unit 54, 
which will cause displacement information to be cal- 
led from memory device 56, such as a ROM unit, and 
add that displacement information to the reference 
point location information to obtain the beginning 
point of the encoded information region. The second 
method of locating the encoded information region 
assumes that recognition unit 52 is capable of 
uniquely identifying the characters or symbols com- 
prising the encoded information region. In essence, 
recognition unit 52 scans the electronic represen- 
tation of the form for a predefined data type, i.e., the 
specific symbols used for encoding. When the first of 
these glyphs are encountered, the scanner then 
knows, that the encoded information region has been 
located. 

Once located, the encoded information must be 
decoded. This is the role of decoding unit 58. The 
input to decoding unit 58 will be the electronic repre- 
sentation of information encoded by one of the above- 
mentioned encoding schemes. By way of example 
only, such information may include the location and 
type of a field, and a destination in computer memory 
at which to store the data contained In the field. The 
output of decoding unit 58 will be input to processing 
unit 60. The decoded information is men utilized by 
processing unit 60 to determine the physical location 
of the field, and to perform the designated operation 
upon the contents of the field, i.e., copy the contents 
into the specified memory location. It will be 
appreciated at this point that virtually any processing 
of the data contained in the field may be performed by 
processing unit 60. Several different operations may 
be performed on a single field's contents, and a single 



operation may be performed on the contents of sev- 
eral fields. Furthermore, there is no practical upper 
limit on the number of fields that may be processed in 
the manner described above. Thus, although not 

5 shown in Fig. 3, processing unit 60 will generally be 
in communication with peripheral units such as mem- 
ory units, display units, printing units, eta, for 
peripheral processing of selected data. 

As previously mentioned, the present invention 

10 contemplates processing forms entirely In their elec- 
tronic representation. In such cases, there w8J be no 
printed copy of the form, and thus no need for scanner 
50 in a system for processing such a form. Rather, if 
the electronic representation of the form Is compatible 

is with the capabilities of recognition unit 52, the elec- 
tronic representation may be directly passed to recog- 
nition unit 52, and processing of the form is as 
described above. If, however, the electronic repre- 
sentation of the form is in a format incompatible wifli 

20 the capabilities of recognition unit 52, a translation or 
conversion unit (not shown) may be interposed be- 
tween the form source and recognition unit 52. 

Yet another aspect of the present invention is a 
system for the generation of a form which includes 

25 user-modifiable fields and an encoded information 
region carrying information about the processing of 
the form. By way of example, Fig. 4 details such a sys- 
tem 70. According to this aspect of the present Inven- 
tion, a form of the type described above may be 

30 created on a device such as a personal computer 
{P.O.) or work station 72 having a form creation pack- 
age resident thereon or accessible thereto. The P.C. 
or work station will generally be in communication with 
one or more peripheral devices such as a printer 74, 

35 a facsimile machine 76 or another P.C. or work station 
78. 

The system and steps utilized by the P.C. or work 
station 72 for creation of a form is detailed in Fig. 5. 
On a display presenting an image of the form, a user 

40 will select or create one or more fields, by way of a 
field source means, to be carried by the form at step 
80. The field source means may be afield library, user 
interface device such as a keyboard, or other system 
input such as a scanner or the like. After selection or 

45 creation, a field is positionally located on the form at 
step 82, and information about selected attributes of 
the field, such as type, size, location, etc., is stored in 
a form memory at step 84. Similarly, a user may select 
of create and locate arbitrary text and graphics at 

so steps 86, and 88 respectively. 

As a field is selected or created, the user may 
wish to specify an operation or operations to be per- 
formed on later-added contents of the field at 90. 
Operations and/or data independent of the contents 

55 of any field may also be entered at this point Such an 
operation may be symbolically entered (e.g., c=a+b, 
a mathematical relationship where the contents of 
fields a and b are added and the result put in memory 



7 



11 



EP0 459 793 A1 



12 



location c) or logically entered (e.g., rf field x Is tiled- 
in, move the contents of field y to the memory location 
z). Data may be directly entered (e.g., move the value 
"5" into memory location I). The operations and/or 
data may he selected from an operation library, or s 
input on a user interface device such as a keyboard. 
Such operation and/or data wffl also be stored in the 
form memory at 84. 

It may be desirable, when displaying the fields, 
text, graphics, etc., not to include a display of the oper- w 
ations and/or data. Assuming that the operations 
and/or data are not to be displayed, the display func- 
tion 92 may be interposed between the locate func- 
tions 82 and 88, and the form description memory 
function 84. The display function may, of course, be 15 
located elsewhere in the system for the purposes of 
displaying more or less information than fields, text, 
and graphics. 

Virtually simultaneously with the creation of the 
form itself, an encoded form description is created. 20 
The encoded form description may or may not include 
the arbitrary text and graphics, as described above. 
Assuming for illustration purposes that the arbitrary 
text and graphics are not to be encoded, the step of 
encoding the relevant information is shown at step 94, 25 
interposed between the locate and select functions 82 
and 90, and the store in form memory function 84. The 
encoded information is next stored in an encoded des- 
cription memory at step 96. Of course, the encoding 
of step 94 and storage of step 96 may be located else* 30 
where in the system for the purposes of encoding and 
storing more or less information than fields and oper- 
ations. Furthermore, the makeup of the encoding 
apparatus used is a function of the scheme used to 
encode the information. In the schemes described 35 
above a look-up table is generally the interface be- 
tween the uncoded and coded data. However, the 
details of the coding apparatus and scheme are 
beyond the scope of the present invention. The key 
point is that selected information rs passed to a des- 40 
cription encoder which creates an encoded version of 
the information, nearly, if not in parallel with the crea- 
tion of the form itself. 

For the purposes of printing the form or transmit- 
ting the form electronically for use, the form arid its 45 
encoded description must be merged. This is done at 
step 98 by a form composer. The form composer wil 
be preprogrammed with certain instructions about the 
placement of the text, graphics, fields and encoded 
information. For the purposes of a printed form, it is so 
the role of the form composer to ensure that all of the 
relevant Information is fitted onto the printed page. For 
the electronic version of the form, it is the role of the 
form interpreter to locate the various components of 
the form property in the appropriate data structure. 55 
Once composed, the form may be output at 100. The 
form may be printed, sent via facsimile, sent via elec- 
tronic network to P.C.s or workstations, etc., for copy- 



ing, printing, editing or use. 

In the case of a machine acting in the role of user, 
the machine would generate a form with encoded des- 
cription in response to a request, instruction or the like 
in a manner similar to that described above. For 
example, a request may be made via computer or fac- 
simile for information on a particular subject, eliciting 
a response in the form of a printed paper form with 
bubbles to be fflled in representing available specific 
items on the particular subject which may be reques- 
ted. The machine may assemble a form from a library 
of field types, following predefined layout instructions, 
while maintaining a description of the form to be ulti- 
mately encoded and printed on the form or made a 
part of its electronic description. Again, the resulting 
form may be hard-copy or an electronic represen- 
tation of the form. 

The above description has focussed on forms 
carrying encoded information representing the loca- 
tion, type, processing, etc, of the fields associated 
with a form. It is within the scope of the present inven- 
tion, however, that a complete description of all of the 
contents of a form, including arbitrary text and arbit- 
rary graphics, be encoded enabling a complete re-edi- 
ting as appropriate. 

Also, the above description has been from the 
point of view of a form carrying its own encoded des- 
cription for the purposes of processing. It will be 
appreciated, however, that a stand-alone encoded 
description could be used for the programming of a 
form interpreter to handle one or more forms to be 
interpreted. In addition, the encoded data could be or 
include an identifier which identifies the form so that 
the form interpreter may rely on a previously program- 
med description of the form. 

Furthermore, use of individual aspects of the pre- 
sent invention, such as the form itself, have been des- 
cribed only by way of example, and such description 
is not intended to be in any sense limiting. For 
example, although a form may be used by directly 
marking the form with, say a pencil, then scanning the 
form to extract the imparted data, a form according to 
the present invention may also by used by displaying 
the form on a P.C., work station or the lice and enter- 
ing the data by means of an appropriate input device 
such as a keyboard, mouse, etc 

In summary, it will be appreciated that the present 
invention provides a simple and convenient way to 
access or store data, or program a form interpreter for 
the processing of data contained on a form. The pre- 
sent invention makes practicable the use of cus- 
tomized forms or forms used in small numbers, 
reduces the burden of programming the form interpre- 
ter, and reduces the required memory space 
associated with a form interpreter, among other 
benefits, by creating a direct correlation between an 
encoded description and processing of the form and 
the data the form carries. 
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Claims 

1. A form for use in a machine data entry and ret- 
rieval system of the type including a decoding 
section for decoding encoded information carried s 
by the form, comprising: 

a carrier (10) for carrying at least one type 
of indicia, which may be modifiedby a user; 

a field region (18, 20, 22) arbitrarily located 
on the carrier for receiving machine-readable 10 
user-imparted or modified indicia, and 

an encoded information region (26) arbit- 
rarily located on the carrier for receiving machine- 
readable encoded information representing 
selected attributes of the field region. 15 

2. The form of claim 1 , wherein the field region con- 
tains field data at a particular physical location on 
the carrier, and wherein the encoded information 
region contains encoded information represent- 20 
ing the location of the field data, such that the data 
entry and retrieval system, upon decoding the 
encoded information may locate and read the 
field data. 

25 

3. The form of claim 1 or 2, wherein the field region 
contains field data upon which a specified oper- 
ation is to be performed, and the encoded infor- 
mation region contains encoded information 
representing instructions to the data entry and 30 
retrieval system which will result in the operation 
being performed on the data. 

4. A system for reading field data from a printed form 
(10), wherein the form has printed thereon 35 
encoded information representing selected attri- 
butes of the field data, including a complete des- 
cription of the location of the field data, 
comprising: 

a scanner (50) for producing an electronic 40 
representation of the printed form; 

processing means (54) for locating in the 
electronic representation of the printed form the 
encoded information; 

decoding means (58) for decoding the 45 
encoded information, and 

processing means (60) for locating in the 
electronic representation of the printed form the 
field data from the decoded location information. 

50 

5. A system for inputting data to a data processing 
system, comprising: 

a printed form (10) upon which are located 
the data to be input and upon which is located 
encoded information representing selected attri- 55 
butes of the data, including a complete descrip- 
tion of the location of the data; 

a scanner (50) for producing an electronic 



representation of the printed form; 

processing means (54) for locating in the 
electronic representation of the printed form the 
encoded information; 

decoding means (58) for decoding the 
encoded information, and 

processing means (60) for locating in the 
electronic representation of the printed form the 
field data from the decoded location information. 

6. The system of claim 5, wherein the data are of the 
type upon which a specified operation Is to be per- 
formed, and wherein the encoded information 
includes instructions interpretable by the data- 
processing system which win result in the oper- 
ation being performed on the data. 

7. A system for producing a printed form having a 
user-modifiable field region arbitrarily located 
thereon, wherein the form has printed thereon 
encoded information representing selected attri- 
butes of the field region, comprising: 

means for providing a user selected field 
having user selected attributes; 

means for positioning the selected field at 
an arbitrary position on the form; 

means for providing an encoded descrip- 
tion of the user-selected field; 

means for providing an encoded descrip- 
tion of the user-selected attributes; 

encoding means for providing an encoded 
description of the user-selected position of the 
selected field, and 

composing means for composing a form, 
the form including the selected field and encoded 
information representing the selected attributes, 
and position of the selected field. 

8. The system of claim 7, further including an oper- 
ation source for providing a user-selectable oper- 
ation to be applied to the contents of the selected 
field, and an encoding means for providing an 
encoded identification of the selected operation, 
and wherein the form composing means com- 
poses a form-including the selected field and 
encoded information representing the selected 
attributes, operation to be applied, and posffion of 
the selected field. 

9. A system for producing a printed form having at 
least one user-modifiable field region arbitrarily 
located thereon, wherein the form has printed the- 
reon encoded information representing selected 
attributes of the at least one field region, compris- 
ing: 

a field library containing one or more user- 
selectable fields; 

an encoded description library containing 
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an encoded description of the attributes of each 
of the user-selectable fields contained In the field 
library; 

type selection means for selecting one or 
more fields from the field library; 5 

position selecting means for positioning 
the selected one or more fields at arbitrary loca- 
tions on the form; 

encoding means for generating an 
encoded description of the selected position of 10 
the selected one or more fields, and 

form composing means for composing a 
form, the form including at least one selected field 
and encoded information representing the selec- 
ted attributes, and position of the or each selected 15 
Held. 

10. The system of claim 9, further including an oper- 
ation library containing one or more user-select- 
able operations to be applied to the contents of a 20 
selected field, and an encoded operation library 
containing an encoded identification of the one or 
more user-selectable operations, and wherein 
the form composing means composes a form 
including the or each selected field and encoded 25 
information representing the selected attributes, 
operation to be applied, and position of the or 
each selected field. 
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